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(54) DNA synthesis 

(57) A method of synthesizing long chain DNA, in which blocks having 4 to 8 base sequences are purely chemically 
ligated by a so-called solid phase method (triester method) using aminated CPG as a carrier. 



The drawings originally filed were informal and the print here reproduced is taken from a laterfiled formal copy. 
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SPECIFICATION 

A method of synthesizing long chain DMA 

5 The present invention relates to a novel method of synthesizing long chain DNA carrying information for 
synthesis of specific proteins and, more particularly, it relates to a method of synthesizing long chain DNA 
purely chemically, i.e. without the use of enzymes. 

It has been known that the synthesis of polypeptides by a gene technological means using synthetic gene 
is possible by steps of (1 ) synthesis of structural gene; (2) recombination of the gene into suitable plasmid; 

10 (3) transformation of suitable host by the formed chimera plasmid; and (4) obtaining of the desired 
polypeptide by culturing the transformed substance. 

Recently, developments of DNA probe is attracting public attention as a novel means for gene technology. 
This is a a method of identifying unknown DNA and RNA which is a transcribed product by a hybridization of 
single stranded DNA and RNA which are known in the art by utilizing properties of DNA and RNA that they 

15 form duplex by selecting complimentary substance just like the relation of template and casting. Since very 15 
sensitive and prompt identification is possible by utilizing the hybridization method, said method can be 
applied for diagnosis of precise name of disease by finding specific DNA and RNA in a gene level from blood 
and cells of patients and pathogenic bacteria. Accordingly, DNA has important value as diagnostic agent by 
its utilization as DNA probe. 

20 With reference to the above-given DNA as structural gene and that as a source of utilization as DN A probe, 20 
it has been known in terms of its nature that the longer the base sequence, the more important as 
information source and the wider in utilization range to DNA probe. However it has been also known that the 
longer the base sequence, the more difficult in its synthesis. 
Consequently, development of technique in synthesizing long chain DNA by easy manner has been 

25 desired. 25 
Conventional method for synthesizing DNA is as follows. Thus, first, comparatively short DNA fragment 
with 10 to 20 basic residues is chemically synthesized, then they are combined to prepare fragments having 
total structure of double stranded DNA exhibiting information on desired peptide synthesis, and finally they 
are combined using an enzyme called DNA ligase. 

30 However, by such a method, only comparatively short fragments with 1 (monomer), 2 (dimer) or 3 (trimer) 30 
bases are manufactured prior to block condensation and it is not possible to synthesize long DNA with 80 
residues or the like. 

In addition, in said method, it is essential to use an enzyme called DNA ligase. Therefore, in synthesizing 
double stranded DNA as gene, it is necessary that all base sequences constituting double stranded DNA are 
35 synthesized at one time. Accordingly, the above method Is not so effective in a process of synthesizinp 3c 
double stranded DNA. 

The present inventors have continued studies in order to overcome the above technical difficulty and have 
succeeded in synthesizing DNA with 46 bases or so by utilizing a method called a triester method (among the 
so-called solid methods) in which 1% polystyrene is used as a support and the compounds of 4 (tetramer) or 
40 5 (pentamer) bases are subjected to a repeated condensation. 40 

Even by such a method, however, the base numbers in the resulting DNA are 50 at the largest and there is 
still a difficulty In synthesizing DNA with chains of as long as 80 to 1 50 residues, 

(Problems that the Present invention Soives) 
45 In view of the above, the present inventors have further carried out continued studies paying their 45 
attention to (1 ) the synthesis of long chain DNA carrying as much as gene information and (2) the synthesis 
under more advantageous conditions and finally achieved the present invention. 
According to the present invention there is provided:- 

A method of synthesizing long chain DNA, characterised in that, blocks having 4 to 8 base sequences are 
50 purely chemically ligated by a so-called solid phase method (triester method) using aminated controlled 50 
pore glass as a carrier. 
The present invention will be further illustrated as hereunder:- 

Each block prior to the condensation can be obtained by the conventional way in which each base is 
subjected to a liquid phase synthesis. 

55 Aminated CPG (controlled pore glass) (cf. Tetrahedron, 24, 747-750, 1983) used in the present invention is 55 
used as a carrier in the solid phase method. To the amino group of this substance is combined 
deoxythymidine which is changed to 3'-sucinate by usual method. This is used as a carrier for nude side. 
Each desired block is extended, on this resin, to the direction of 5'-termina! successively. As to condensation 
agent, mesityien sulf nyl-3-nitrotriaz Nde(MSNT) can be used, for example. The her by resulting DNA is 

60 single stranded and th complimentary strand DNA which is necessary f r preparation of duplet DND can be 60 
asily obtained by the similar way. Or such duplet DNA can be very asily obtained by the use of DND 
polymerase using short fragment (10 b.p. or so) which is complimentary with 3"-terminal region of the 
resulting single stranded DNA. The fact that DNA polymerase can be used Is on of the most advantag ous 
merits of th present invention that the condensation reaction can b accomplished without the aid of DNA 

55 ligase which has been widely used in conventional meth ds. 55 
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The resulting duplet DNA is combined to give vector plasmid by the known method, then transformed to 
bacteria such as Escherichia coli, and the strain is cultured to afford desired polypeptid . In the above steps, 
various gene technological means which have been already established can be applied. 
It is possible in accordance with the present invention to synthesize DNA with as long as 80 to 150 residues 
5 and, therefore, polypeptides with 1 5 to 30 amino acids can be synthesized by the known gene technological 5 
means. For instance, the following polypeptides can be synthesized. They are growth hormone-release 
inhibiting factor (Somatostatin, containing 14 amino acids), stomach acid secreting stimulant (Gastrin, 
containing 17 amino acids), duodenum ulcer remedy (Secretin, containing 27 amino acids), stimulant for 
secretion of growth hormone, insuline and blood sugar level increase (Glucagon, containing 29 amino 
10 acids), morphine like agent (beta-Endorphin, containing 31 amino acids), and hypercalcemia remedy 10 
(Calcitonin, containing 32 amino acids), and the like. 

In addition, the long chain DNA of the present invention is applied not only for DNA base sequences of 
structural gene parts but also for the manufacture of general DNA including regulatory sites and specific 
sequences as well as for long chain DNA probe recognizing their structures. Accordingly the present 
1 5 Invention can be positively applied for development of diagnostic agents. 1 5 

[Effect of the invention) 

According to the present invention, long chain DNA can be synthesized simply and in large quantities. The 
long chain DNA of the present invention can be effectively utilized as (1 ) gene information source concerning 
20 polypeptide synthesis and (2) a sou rce for application of DNA probe in view of gene technology. 20 

Production of DNA has been 0.1 OD (1 OD is equivalent to about 50 micrograms) per one lot at best 
However, in accordance with the present invention, it is now possible to manufacture in quantities as large 
as 30 to 50 OD per lot. Consequently, expansion of utilizable field of long chain DNA as a gene and as a DNA 
probe can be expected. 

25 25 
(Examples) 

The present invention is further illustrated by giving examples concerning synthesis of endorphin whose 
physiological activities such as central nervous analgesic action and endocrine hormone action have been 
known. 

30 30 

(1 ) Synthesis of each block constituting base sequences including endorphin gene. 
Amino acid sequence of endorphins has been known and the DNA base sequence corresponding thereto 

can be freely selected by referring to a table of coden usage. They are given as hereunder together with their 

relation between each block constituting DNA base sequences used In the present invention. The upper, 
35 middle and lower columns are each block (figures therein are block numbers), base sequence and 35 

corresponding amino acid sequence, respectively. Incidentally, restricted enzyme sites are given at both 

terminals of DNA base sequences. Said sites are used in inserting plasmid. 

CD a -Endorphin 

40 40 

« 13 m 12 * 11 « 10 > 

5' ACCTGCAGCC CGT CGC TAC GGT GGT TTC ATG 
I Pit! Arg Arg TV? Giy Gly Phe Met 

45 45 

• 9 » 8 » 7 * 6 — 

ACT TCT GAG AAG TCT CAA ACT CCA TTG GTG 
Thr Ser Glu Lys Ser Gin Thr Pro Leu Val 
2 

50 50 

2 1 

5—*— 4 M 3 N * > 

ACT TAA TAG GGCTGCAGGT .3' 
Thr STOP STOP Pstl ' 
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<2> a- [Leu 5 ] -Endorphin 

« 13 16 " 15 * 10— 

5' , ACCTGCAGCC ATG TAC GGT GGT TTC TTG 
1 Pstl Met Tyr Gly Gly Phe Leu" 



« 9 * 8 * 7 * 6 — >— 

ACT TCT GAG AAG TCT CAA ACT CCA TTG GTG 
10 Thr Ser Glu Lys Ser Gin Thr Pro Leu Val 



1 



20 ® 7- [Leu 6 1- Endorphin 

* 1 3 — ' 16 15- ^ 10— 

5' | ACCTGCAGCC ATG TAC GGT GGT TTC TTG 
1 Pstl Met Tyr Gly Gly Phe Leu" 



25 



30 



-9 * 8 * 7 * 6* 



ACT TCT GAG AAG TCT CAA ACT CCA TTG GTG 
Thr Ser Giu Lys Ser Gin Thr Pro Leu Val 



5- 



ACT TTG TAG GGCTGCAGGT I 3' 
Thr Leu STOP Pstl ' 



35 



• < 13 » 12 * 11 * 10 ► 

5 r f ACCTGCAGCC CGT CGC TAC GGT GGT TTC ATG 
40 1 Pstl Arg Arg Tyr Gly Gly Phe Met 



-6- 



ACT TCT GAG AAG TCT CAA ACT CCA TTG GTG 
45 Thr Ser Glu Lys Ser Gin Thr Pro Leu Val 



2 1> 

5-*- 17-*- 3 **— * — > 

50 ACT TTG TAG GGCTGCAGGT, 3' 
Thr Leu STOP Pstl 1 



10 



15 ACT TAA TAG GGCTGCAGGT | 3' 

Thr STOP STOP Pstl ' 15 



20 



25 



30 



9 7- Endorphin 35 



40 



45 



50 
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Among the bl cks c nstttuting the above endorphin genes, the block7 was synthesized by the steps as 
given below. 



10 



15 



20 



25 



30 



bzbz ( I ) 
d (DttTr) AeAecE 



TEA 



bzbz (II) 
d (DHTr) AeCecE 



bzbz 
d (DHTr) AeAeo- 
(V) 



BSA 



bzbz 
dAeCece 

' (VI) 



bz ( m ) 
(DHTr) T B C B cE 

(IV) 

TEA bzbz 
d (DHTr) CeAece 



HSHT 



bzbzbzbz 
d (DHTr) AeAeAeCecs 

. (K) 

TEA 



bz 

(DHTr) TeCeo- 
(\1) 



bzbzbzbz 
d (DHTr) AeAeA*Ceo- 
(XI) 



| HSfiT 



BSA 



bzbz 
dCeAe ce 
— 1 (W) 



bzbzbz 
d (DHTr) TeCeCeAecE 
I . (X) 
1 BSA 
bzbzbz 



f (X m 



HS3T 

1 (X in) 

bzbzbzbz' bzbzbz 

d (DHTr) AeAeAeC2T E CeCeAecE 

TEA 

■" ( X IV ) 
bzbzbzbz bzbzbz 

d (DHTr) AeAeAeCaTeCeCe'Aeo- — 
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20 



25 



30 



-Block 7 



35 



40 



DHTr: 
bz. 
A " 
bz. 
C * 
T: 

E ! 
BSA: 
TEA: 



4 , 4 1 -Dime tboxy nolity 1 
N— Benzoyl adenosyl 

N-Benzoylcy tidy ly 1 
Tymidylyl 

o-Chloro phenyl phosphate 
Benzenesulfonic add 
Trie thy lamina 



35 



40 



Other blocks (1-6, and 8-17) constituting endorphin type genes can be synthesized by similar way. Each 
45 yield is given as hereunder. 



45 



50 



55 



60 



65 



Blocks 


Base Sequences 


Yield 




1 


CAGG 


77 




2 


GCTG 


94 


50 


3 


TAGG 


70 




4 


TTAA 


104 




5 


TGAC 


93 




6 


TTGG 


67 




7 


AAACTCCA 


70 


55 


8 


GAAGTCTC 


62 




9 


ACTTCTGA 


65 




10 


GTTTCATG 


65 




11 


CTACGGTG 


70 




12 


GCCCGTCA 


65 


60 


13 


ACCTGCA 


65 




14 


GTTTCTTG 


70 




15 


GTACGGTG 


67 




16 


GCCAT 


75 




17 


TTTG 


95 


65 
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(2) Endorphins genes synthesis: 

alpha-Endorphin gene (deoxy 80 mer) containing restricted enzyme sites was synthesized by a solid phase 
method as follows. 
1, Deoxytymidine CPG resin is washed with CH 2 CI 2 /MeOH. 
5 2. Detritylation is conducted with 2% BSA/CH 2 CI 2 (this was conducted repeatedly and promptly until 5 
colorization disappears) 
3. Subjected to azeotropic drying after substituted with pyridine. 
A solution of each block is added, subjected to azeotropic drying, and MSNT and pyridine for the 
reaction are added. Allowed to stand at room temperature and washed with pyridine. 
10 4. 0.1M Dimethylaminopyridine/pyridine solution and acetic anhydride are added, allowed to stand at < 10 
room temperature, and washed with pyridine. 
The above operation is conducted repeatedly, for 13 times in total. Average yield of this reaction was 84%. 
Then the resin is deprotected, at room temperature, with a solution of 0.1 M tetramethylguanfdine-pyridine 
aldoxime {cf. C.B. Reese, et al : Tetrahedron Lett, 2727, 1978) in dioxane-water, then washed with 
15 pyridine-water, the washing is concentrated in vacuo, concentrated ammonia water is added thereto, and 1 5 
the mixture is warmed. Ammonia is evaporated therefrom and a part of the residue is taken using 
dimethyoxytrityl group as a target to calculate the yield of the final stage. 

The residual reaction solution is subjected to a reversed phase (C 18 silica gel for Prep 500 manufactured by 
Waters), ion exchange (DEAE-toy opal), and reversed phase (C 18 silica gel, TSK-Gel 10-20 micrometers) open 
20 chromatographies to afford pure alpha-endorphin gene (containing restricted enzyme sites) (- deoxy 80 20 
mer). 

Purity was confirmed by HPLC (Nucleosil 300-7 C 18 ) and by electrophoresis and its base sequences were 
confirmed by Maxam-Gilbert method. The result is given in Rgure 1 to Rgure 3. 

Similarly prepared were alpha-(Leu 5 )-endorphin gene (containing restrictive enzyme site) (deoxy 77 mer), 
25 gamma-(Leu 5 )-endorphin gene (containing restrictive enzyme site) deoxy 77 mer) arid gamma-endorphin 25 
gene (containing restrictive enzyme site) (deoxy 80 mer). 

(3) Synthesis of duplet DNA and its combination with vector plasmid. 

Each one mole of deoxy 80 mer and synthetic nucleotide primer which is complimentary with 3'-terminal 
30 of the former were mixed, heated at 65°C, and cooled to room temperature to anneal the deoxy 80 mer and 30 
the primer. Then £ coii polymerase I (Klenow fragment) was added by conventional may and made to react 
at 37°C for 30 minutes so that DNA was made into double stranded. 

DNA was recovered as a precipitate In ethanol, made to react at 37°C for 30 minutes using T 4 
polynucleotidekinase, and both 5'-terminals of the double stranded DNA were phosphoryiated. 
35 Then the vector plasmid pUC 8 DNA was scissored with a restrictive enzyme Pst 1 , added to the above 35 
double stranded DNA solution, made to react at 16°C overnight with T 4 DNA ligase, and the double stranded 
d 80 mer DNA was com bined with the vector plasmid. 

(4) Cloning of plasmid containing endorphin gene. 

40 The plasmid prepared as above was transformed into £ cof/JWi 1 03 strain by conventional way, then 40 
selected using a deficiency of beta-galactosidase activity present in the pUC 8 as a target, and plasmid 
molecules were collected by cloning from the strain. 

It has been confirmed that plasmid in which endorphin gene was inserted into the correct orientation and 
position as desired in accordance with Maxam-Gilbert method. 

45 45 

(5) Obtaining of endorphins. 

Transformed £ co//JM 103 strain was precultured overnight in an LB medium, planted in 2YT medium, 
and subjected to a shake culture at 37°C. 

IPTG was added to the logarithmic productive phase stages (initial, medium and final stages) to make it 
50 0.5mM and synthesis of endorphin was induced. After being induced by IPTG, fused protein was extracted, 50 
and analyzed by HPLC whereupon it was found that adequate quantity of protein production was observed 
(1-5.0 x 10 5 molecules per cell) when induction was applied atthe initial stage of logarithmic productive 
phase. 

With reference to natural type alpha-endorphin and gamma-endorphin hacing methionine residue in a 
55 molecule, they were treated with trypsin by conventional way. With reference to alpha-(Leu s )-endorphin and 55 
gamma-(Leu s )-endorphin having leucine residue in place of methionine, they were treated with BrCN. 
Anyway, each f desired endorphin proteins desired was subjected t ac lumnchromat graphy according 
t th general purificati n method f proteins whereupon each of them was separat dandpurifi d. 
The fact that each of the resulting endorphin molecules exhibits desired amino acid sequence was 
60 confirmed by the fact that they were identical with the samples already btained by the peptide synthesis by 60 
testing with HPLC using a revers phase carri r. 



4. Brief Explanation of Drawings: 
Figure 1 is X-ray autoradi gram showing th result of 20% polyacryamide lectr ph resis fde xy80mer 
65 containing alpha-endorphin gen synthesized after determinati n by Maxam-Gilbert method. 65 
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Rgure 2 is X-ray autoradiogram sh wing the result of 8% p tyacrytamide electrophoresis of deoxy 80 mer 
containing alpha-endorphin gene synthesized after determination by Maxam-GIIbert method. 

Rgure 3 shows the result of high performance liquid chromatography (Nucleosil 300-7 C 18 ) of deoxy 80 
mer containing alpah-endorphin gene synthesized after determination by Maxam-Gilbert method. Ordinate 
5 and abscissa show absorbancy and time, respectively. Solvent system used was triethylamlne acetate- 5 
acetonitrile and the flowing speed was 1.0 ml/min. 

CLAIMS 

j 0 1. A method of synthesizing long chain DNA, characterised in that, blocks having 4 to 8 base sequences 10 
are purely chemically ligated by a so-called solid phase method (triester method) using aminated controlled 
pore glass as a carrier. 

2. A method as claimed in Claim 1 in which the blocks having 4 to 8 base sequences are subject to 
condensation reaction. 

15 3. A method of synthesizing long chain DNAand which is substantially as described herein. 15 . 
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